Rank in Wordlist | Frequency | Word |
---|---|---|
3283 | 18 | 10,000 |
5463 | 11 | 2,000 |
6566 | 9 | 30,000 |
6568 | 9 | 5,000 |
8224 | 7 | 100,000 |
8243 | 7 | 25,000 |
8244 | 7 | 3,000 |
8250 | 7 | 4,000 |
9426 | 6 | 20,000 |
9428 | 6 | 35,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
14013 | 4 | SU(3 |
25118 | 2 | SU(2 |
25479 | 2 | U(1 |
34990 | 1 | 1992)(ISBN |
35251 | 1 | 3)×SU(2)×U(1 |
38123 | 1 | Cup(ASCC |
42212 | 1 | Ingliż(Lingwa |
42213 | 1 | Inglott(1987 |
44510 | 1 | M(x |
44667 | 1 | Maltija(1994 |
Rank in Wordlist | Frequency | Word |
---|---|---|
31669 | 2 | pre)Thing |
34168 | 1 | %) |
34753 | 1 | 1846-1935)li |
34754 | 1 | 1847),suldat |
34990 | 1 | 1992)(ISBN |
35041 | 1 | 2-1),żewġ |
35251 | 1 | 3)×SU(2)×U(1 |
35356 | 1 | 354-430)li |
35873 | 1 | 955)u |
37518 | 1 | C.E.). |
Rank in Wordlist | Frequency | Word |
---|---|---|
4642 | 13 | 50% |
7289 | 8 | 10% |
7308 | 8 | 30% |
8251 | 7 | 40% |
8257 | 7 | 60% |
9433 | 6 | 70% |
11070 | 5 | 75% |
11071 | 5 | 90% |
13236 | 4 | 14% |
13282 | 4 | 55% |
Rank in Wordlist | Frequency | Word |
---|---|---|
29819 | 2 | l-R&B |
36657 | 1 | B&W |
44506 | 1 | M&E |
44507 | 1 | M&M's |
77729 | 1 | tal-A&R |
78401 | 1 | tal-R&B |
Rank in Wordlist | Frequency | Word |
---|---|---|
34150 | 1 | $0.01 |
34151 | 1 | $13,800 |
34152 | 1 | $132 |
34153 | 1 | $15.6 |
34154 | 1 | $16.5 |
34155 | 1 | $200 |
34156 | 1 | $22,7 |
34157 | 1 | $250,000 |
34158 | 1 | $3,000 |
34159 | 1 | $5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
193 | 226 | b'hekk |
291 | 165 | b'mod |
377 | 128 | f'Malta |
381 | 126 | f'dan |
402 | 121 | darb'oħra |
427 | 115 | t'Isfel |
529 | 96 | f'din |
751 | 70 | f'dak |
823 | 64 | F'dan |
844 | 63 | f'kull |
Rank in Wordlist | Frequency | Word |
---|---|---|
7239 | 9 | u/jew |
16553 | 3 | 120/80 |
16606 | 3 | 2005/06 |
17751 | 3 | Palestina/Erets |
19116 | 3 | http://rsssf |
22133 | 2 | 1/2 |
22134 | 2 | 1/4 |
22137 | 2 | 100/6 |
22272 | 2 | 2006/07 |
22274 | 2 | 2007/2008 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots